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Abstract —Heterogeneous networks (HetNets) have been con¬ 
sidered as a promising architecture for upcoming 5G networks 
due to their high energy and spectrum efficiency. However, 
providing backhaul connectivity for all macro and pico base 
stations (BSs) in HetNets constitutes a significant share of 
infrastructure cost. Recently, the idea of flexible backhaul has 
drown a lot of attention both from industry and academia. 
Under this architecture, not all the pico BSs are connected to the 
backhaul, resulting in a significant reduction in the infrastructure 
costs. In this regard, pico BSs without backhaul connectivity 
need to communicate with their nearby BSs in order to have 
indirect accessibility to the backhaul. This makes the radio re¬ 
source management (RRM) in such networks more complex and 
challenging. In this paper, we address the problem of cross-layer 
RRM in HetNets with flexible backhaul. We formulate the RRM 
problem as a two timescale non-convex stochastic optimization 
problem which jointly optimizes flow control, routing control, 
interference mitigation and link scheduling in order to maximize 
a generic network utility. By exploiting the hidden convexity 
of this non-convex problem, we propose an iterative algorithm 
which is converges to the global optimal solution. The proposed 
solution has low complexity and requires low signalling and 
message passing among different nodes, which makes it scalable. 
Moreover, due to the proposed two-timescale design, it is robust 
to the backhaul signalling latency as well. Simulation results 
demonstrate the significant performance gain of the proposed 
solution over various baselines. 

Index Terms —Flexible backhaul, heterogeneous networks, 
cross-layer radio resource management, two timescale stochastic 
optimization. 

1. Introduction 

Heterogeneous networks (HetNets) have attracted a lot of 
attention as a promising approach for improving energy ef¬ 
ficiency while providing high capacity in future mobile access 
networks [?], [?]. In current HetNet design, many pico cells 
are deployed across the network, each with fixed high-capacity 
backhaul connectivity that guarantees the fulfilment of the 
capacity demand of each hotspot [?]. However, it suffers from 
high CAPEX and OPEX [?]. Because in order to guarantee 
the fulfilment of the capacity demands, the statically allocated 
resource and deployment should be highly excessive, which 
leads to low utilization efficiency when the traffic is low. It 
has been shown that due to the large number of pico BSs 
in future HetNet deployments, providing the fixed backhaul 
for all BSs will lead to significantly huge increase in the the 
network power consumption [?], [?]. 

In addition, future networks should provide better support of 
emergency communications as well as fast recovery, in case 
of unpredicted crash of access points. However, in a fixed 


backhaul deployment, the crash of one point needs physical 
replacement (e.g., using an emergency communication vehicle) 
which suffers from slow network recovery and low reliability. 

To overcome the aforementioned problems of fixed back¬ 
haul deployment, the idea of flexible backhaul is proposed. The 
concept behind fiexible backhaul is to fiexibly utilize any idle 
network resources to maximize end-to-end experience with 
minimal cost. The key features of fiexible backhaul technology 
can be identified as follows: 

• Flexible utilization of system resources: Flexibly uti¬ 
lizing any system resources including wired (e.g., optical 
fiber, cable), or wireless (e.g., microwave, idle nodes) 
and hybrid resources (e.g., combined optical fiber and 
idle nodes), in order to increase the resource utilization 
efficiency. 

• Dynamic resource scheduling: Fully exploiting the de¬ 
gree of freedom of network resources (in terms of time, 
frequency, space, power, etc.), in order to maximize the 
transmission capability of backhaul. 

• Dynamic network topology: Intelligently adjusting net¬ 
work topology and backhaul transmission strategies, in 
order to match the traffic varying and meet the transmis¬ 
sion/reliability requirements. 

Considering these features, the expected benefits of fiexible 
backhaul can be categorized in four different aspects: 

1) Cost: Lower CAPEX and OPEX will be gained through 
aggregation and reusing of idle system resources. 
Mainly, under fiexible backhaul architecture, not all of 
the pico BSs are connected to the backhaul, resulting in 
a significant reduction in the infrastructure costs. 

2) End-to-end experience: Flexible backhaul provides bet¬ 
ter user experience through fiexible resource allocation. 

3) Reliability: Intelligent topology establishment would 
bring higher system reliability. For example, crash of 
an access point will not lead to unreachability to the 
backhaul, since it can dynamically reroute the traffic to 
other access points. 

4) Interoperability: It also provides easier inter-radio ac¬ 
cess technology (inter-RAT) interoperability as data can 
be routed through different RATs under fiexible back¬ 
haul. 

To achieve the above benefits, a proper dynamic RRM 
scheme should be proposed. However, RRM for fiexible 
backhaul is much more challenging than traditional RRM 
for HetNets with fixed backhaul. In HetNets with fiexible 
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backhaul, the data flows associated with a BS without backhaul 
connectivity may ride on the idle backhaul resources of other 
BSs to reach to the backhaul network. Hence, some BSs 
can relay for the data flows of that BS. Accordingly, multi¬ 
hop routing will be required among different BSs. This leads 
to higher complexity in resource allocation as well as more 
complex signalling and message passing that need to be 
efficiently dealt with. Furthermore, relaying other BSs’ data 
flows by a BS may increase the interference in the HetNet that 
should be addressed as well. Accordingly, the existing RRM 
schemes for HetNets with flxed backhaul cannot be applied to 
HetNets with flexible backhaul. 

On the other hand, most existing works on RRM in HetNets, 
only consider short-term instantaneous CSI adaptation or long¬ 
term statistical CSI adaptation. However, in most practical 
cases, a mixed timescale CSI knowledge is available in the 
HetNet: The local instantaneous CSI knowledge is available 
at each BS, while the global long-term statistical CSI knowl¬ 
edge of the network can be available at a central network 
controller. Therefore, in order to fully utilize the available 
mixed timescale CSI knowledge, a two timescale hierarchical 
RRM should be considered in the HetNet. 

Furthermore, the existing works on two timescale RRM 
for HetNets are mostly based on heuristic approaches [?], 
[?], i.e., the RRM solution is not derived from a single 
optimization problem. In [?] authors formulate a two-timescale 
hierarchical RRM problem for HetNets with enhanced inter¬ 
cell interference coordination and propose an asymptotically 
optimal solution for high SNRs. Yet, none of these works 
consider multi-hop routing which is an intrinsic characteristic 
of HetNets with flexible backhaul. 

In this paper, we focus on the problem of dynamic resource 
control for heterogeneous networks with flexible backhaul. We 
model the problem as a two timescale stochastic optimization 
problem and propose a two timescale control structure where 
the long-term controls are adaptive to the large scale fading, 
and the short-term control is adaptive to the local channel 
state information (CSI) within a pico or macro BS. In such 
a hierarchical RRM design, the long-term controls can be 
implemented centrally on an RRM server (RRMS), and the 
short-term controls can be updated locally at each pico or 
macro BS. 

In general, the problem of two timescale hierarchical RRM 
in HetNets with flexible backhaul is highly non-trivial and 
there are some challenges that need to be tackled properly. 

• Non-convexity: The overall radio resource optimization 
problem is non-convex. Hence, the conventional convex 
optimization techniques cannot be used to solve the 
problem. 

• Lack of Closed-form Expectation: The stochastic op¬ 
timization problem involves expectation operation in the 
constraints related to the average data rate of the links, 
which do not have closed form expression. 

• Mixed Timescale and Combinatorial Optimization 
Problem: Due to the mixed timescale RRM structure, the 
problem belongs to two timescale stochastic optimization 
problem where the optimization variables change at dif¬ 
ferent timescales. Moreover, the DTX control and links 


scheduling control variables are combinatorial variables. 
Therefore, the current stochastic optimization techniques 
such as stochastic cutting plane cannot be applied in such 
problem. 

• Complex Coupling between long-term and short-term 
control variables: Since the average data rate constraint 
of the links involve both long-term and short-term control 
variables, there is a strong coupling between these control 
variables. Therefore, the short-term and long-term control 
variables cannot be solved independently. 

To address the above challenges, we first apply the primal- 
dual decomposition method to decouple the optimization prob¬ 
lem into two subproblems, inner and outer problems.The inner 
problem involves data flow control and routing control, while 
the outer problem involves long-term interference mitigation 
among BSs and short-term link scheduling control. The inner 
problem is convex and can be solved by standard convex 
optimization methods. On the other hand, the outer problem 
is non-convex and involves combinatorial optimization. Using 
a hidden convexity in the outer problem, we propose an 
iterative algorithm to And its global optimal solution. Finally, 
we simulate and compare our proposed solution with various 
baselines to illustrate the significant performance gain of our 
proposed solution. 

H. System Model 
A. Heterogeneous Network Topology 

Consider the downlink of a two-tier multi-cell heteroge¬ 
neous network, as illustrated in Fig. 1. Within each cell, 
there are one macro BS, several pico BSs and several MUs. 
Moreover, there exists a radio resource management server 
(RRMS) in the network which coordinates the resource allo¬ 
cation among BSs. All the BSs are connected to and controlled 
by this central RRMS via a low cost signalling backhaul (e.g., 
X2 interface in LTE [?]). The gray lines between RRMS 
and each BS indicates the control plane, which represents the 
signalling interface between macro/pico BSs and RRMS. 

In order to reduce the backhaul cost, only the macro BSs 
and a portion of the pico BSs are connected to the high speed 
payload backhaul. The other pico BSs do not have direct 
access to the backhaul and hence need to communicate with 
the other BSs in order to reach to the backhaul. It is assumed 
that the set of BSs with backhaul connection is known. 

There are K data flows that are to be routed from some 
source BSs, which have connection to the backhaul, to some 
destined MUs in the network. BSs need to communicate with 
each other to transfer these flows from the sources to the end- 
users in a multi-hop mode. In this regards, some BSs need to 
relay the data of other source BSs in order to help their data 
flows reach to the associated mobile users. 

The total available bandwidth is divided into M subbands 
which are shared by the BSs for the data transmission (BS 
to MU or BS to BS transmission). The HetNet topology is 
represented by a topology graph defined below. 

Definition 1. (HetNet Topology Graph) Define the topology 
graph of the HetNet as a directed graph G = {A/", C} where 
M is the set of all BSs and MUs and C is the set of all 
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Fig. 1: A two-tier HetNet with macro and pico BSs. The 
signalling interface has been shown by the grey lines between 
RRMS and each BS. Note that the payload backhaul is not 
sketched in the figure for clarity. 



Fig. 2: The topology graph of the example HetNet of Fig. 1 
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directed edges (BS-to-BS or BS-to-MU links). Each edge I G C 
is a directed link connecting its head node to its tail node, 
and is associated with its CSI label where hi^rn 

represents the channel coefficient between the head and tail 
nodes of the link on subband m. 

The set of edges that are outgoing from node n is denoted 
by T{n). Note that since the coverage area of the macro BS 
includes the whole cell, there is direct link between the macro 
BS and each MU and hence, all the MUs are also associated 
to the macro BS as well. Moreover, let = jA/'l and L = \C\ 
be the number of nodes and links in the network topology 
graph. 

The topology of the network can be summarized using its 
node-link incidence matrix G, defined as below. 

Definition 2. (Node-link Incidence Matrix) The Node-link 
incidence matrix G for the HetNet is a N x L matrix, with 
a row for each node and a column for each link, in which its 
element is associated with node n and link I and is 
given by: 

{ 1 if n is the head node of link I, 

— 1 if n is the tail node of link I, (1) 
0 otherwise. 


Fig. 3: Superframe and subframe structure. 

Using this topology graph, it is easy to derive the node-link 
incidence matrix of the network, accordingly. 

It can be verified that each column has exactly one 1 and one 
-1 elements, and its other elements are all zero. For example, 
consider the 5^^ column which is associated with link 5. This 
column has one 1 in the first row and one —1 in the fourth row. 
It indicates that link 5 connects node one (which is the macro 
BS) to the node four (which is a pico BS) in the network. 

The time domain is divided into time slots of fixed length 
called superframe as illustrated in Fig. 3. Each superframe 
consists of Ts subframes. There are M resource blocks (RBs) 
in each subframe t which are associated with M subbands. 
Each resource block is indexed by (t, m), where t determines 
the subframe number and m is the subband of that RB. 
Eor clarifying, we refer to superframe, subframe and RB by 
different indices i, t and respectively. 

As in many standard channel models, two-timescale fading 
model has been assumed for the wireless channels between 
different nodes (BS-to-BS or BS-to-MU channels). Accord¬ 
ingly, the channel fading coefficient is given by the following 
formula 


According to the definition of the node-link incidence 
matrix of the network, each column corresponding to link I 
has one -\-l and one —1 entries in the rows corresponding to 
its head and tail nodes, respectively; while all its other entries 
are 0. Moreover, each row of the aforementioned matrix is 
associated with one node in the topology graph of the network. 
Hence, for each BSn, we define as its associated row in 
the node-link incident matrix G. 

The corresponding topology graph for the example HetNet 
of Eig. 1 has been illustrated in Eig. 2. It consists of = 10 
nodes and L = 17 directed edges, representing BSs/MUs and 
links between them, respectively. Note that as we are consider¬ 
ing downlink transmission, the directed links from MUs to BSs 
have not been included in this graph. Moreover, it can be seen 
from this topology graph that T(l) = {1, 2,4, 5, 7} , T(2) = 
{12,13,14,16} , r(3) = {3,8,18}, r(4) = {6,9,10,11} 
and r(5) = {15,17}. 


VI,m (2) 

where the small scale fading remains con¬ 

stant in each subframe and changes over different subframes. 
On the other hand, the large scale fading process > 0 

is usually caused by path loss and shadow fading, and is 
assumed to be a slow ergodic process, i.e., it remains constant 
for several superframes. 

B. Two Timescale Hierarchical Radio Resource Control Vari¬ 
ables 

The radio resource management control variables are cat¬ 
egorized into two groups called long-term and short-term 
control variables. The long-term control variables are deter¬ 
mined centrally at the RRMS in the longer timescale (e.g., in 
each superframe) and are adaptive to the large scale fading 
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process _ ^j^iarge^ Other hand, the short¬ 

term control variables are adaptive to the instantaneous CSI 
H = {hfZ^^ V/,m} and determined locally at each BS in 
the shorter timescale (e.g., in each subframe). 

a) Long-term Control Variables 

1) Flow Control: As mentioned before, there are K data 
flows that are to be routed in the downlink of the HetNet. We 
define flow control vector, d = [di,..., in which each 
element dk indicates the average allowed traffic rate for the 

data fiow. 

As the fiow control is an end-to-end control variable in 
the network, it should be determined centrally at the RRMS. 
Moreover, as they are average values, they do not need to be 
adapted to the short-term realizations of the channels. Hence, 
it is more appropriate to adapt them to the long-term statistical 
information of the channels, i.e., Therefore, fiow 

control is categorized as a long-term control variable, which 
is centrally determined at RRMS. 

2) Routing Control: As the destined users may not be in 
the coverage area of their source BSs, the other BSs should 
help in between in order to provide an accessible routing path 
between the source and destination. 

For generality, we assume multi-path routing is allowed 
in the network. It means that the traffic corresponding to 
each data fiow k can be split arbitrarily across multiple paths 
between the source and destination nodes in the network. As a 
result, each link I may carry some part of the data fiow dk,Vk. 
This is determined in the routing vector defined as following. 

For each data fiow k, the corresponding routing vector Xk = 
[xk,i ,..., ^ is defined in which, each element Xk,i 

indicates the average carried traffic of demand k over link 
1. Hence, the overall ATL-dimensional routing vector can be 
defined as x = [xj ,..., x^Y. 

The routing vectors {x^^k = l,...,Ar} are adaptive to 
global network topology which is a function of the mobility of 
MUs and hence does not change in short timescale. Therefore 
these control variables are regarded as long-term control vari¬ 
ables too and they can be determined according to statistical 
CSI in the longer timescale. Moreover, since they are end-to- 
end control variables, they are implemented centrally in RRM 
server. 

3) Disconnected Transmission (DTX) Control: As the 
macro BS covers all the other nodes (i.e., all pico BSs and 
MUs in the cell), it can cause strong interference to them. This 
is called cross-tier interference. Moreover, pico BSs may also 
suffer from co-tier interference caused by their neighbouring 
pico BSs. To solve these problems, we propose disconnected 
transmission or DTX to eliminate the interference in the 
HetNet. When a DTX is scheduled in a macro or pico BS, 
it will shut down the transmission on the current subframe. 
This eliminates the interference from this BS to the other 
BSs. Hence, scheduling DTX over time domain allows us to 
control interference. Specifically, the DTX pattern at subframe 
t is denoted by a{t) = [ai(t),..., a 7 v(f)] G A, where an{t) 
indicates whether a DTX is scheduled for BSn on all subbands 
of subframe t (an{t) = 0) or not (an{t) = 1), and A is the 


set of all interference-free (i.e., feasible) DTX patterns . Note 
that we may drop the index t when there is no ambiguity. 
In each subframe, the pattern of transmitting BSs is chosen 
by DTX control policy from the set of feasible DTX patterns 
which guarantees that no interference would be caused in the 
HetNet. 

As the optimal DTX control policy depends on the HetNet 
topology graph, we control the DTX dynamically with respect 
to the large scale fading. Therefore, DTX control is categorized 
as long-term control variables. 

Definition 3. (DTX Control Policy) A DTX control policy 0 = 
{A, q} consists of a set of DTX patterns A = {ai,..., a\A\} 
with their associated time sharing q = [gi,..., q\A\Y‘ 
superframe, the DTX control policy determines that each the 
DTX pattern aj will be used in qj percentage of the subframes. 
The set of feasible DTX control policies is defined as 

A0 = |0 = {A,q}: (3) 

l>i| 

A C Vi = 1,..., |A|: 0<qi<l,y2Qi = A- 

i=l ^ 

b) Short-term Control Variables 

4) Dynamic Link Scheduling: As mentioned before, there 
are M available subbands for transmission by BSs. If the 
DTX pattern on the t-th subframe satisfies an{t) = 1, BSn 
is allowed to transmit on the (t,m)-th RB, Vm. To avoid 
interference on each subband m, only one of the associated 
links of BSn is scheduled for transmission on the (t, m)-th 
RB. 

Let pi^ni G {0,1} be the link scheduling variable of link I 
over subband m and pi = , Pi,mY associated 

subband selection vector for link 1. In each subframe, this 
variable determines which subbands are used for transmission 
over link I and is decided locally by its head BS to exploit 
multi-user diversity as well as to avoid inter-link interference. 
For example, if the head BS of link I decides to use the 
subbands mi and m 2 for transmitting over link /, we would 
have pi^rm = 1 and = 1- 

Let p = [pi,..., denote the overall link scheduling 
vector for all the links in the network. The set of all feasible 
link scheduling vectors, under the DTX pattern a, is given by: 

Apia) = ^p=[pi,...,PLf : (4) 

V/ : pi = [pz,i,... ,Pi,mY, 

Vm . p/ 772 G {0, Ij- , ^ ^ Pl,m — 5 VtT/ G Xf j” . 

ier{n) 

The intuition behind Ap{a) is elaborated below. First, each 
pi^ni is a binary integer variable. If the subband is used 
on link /, then pi^ni = 1- Otherwise p^,m = 0 indicates that the 

^th 

subband is not scheduled on link 1. Moreover, if a DTX 
is scheduled for a BS (i.e., = 0), none of its outgoing links 

can use any subband m for transmission. Finally, as both 
and pi^ni are binary, the last constraint implies that a subband 
can be used at most in one of the outgoing links of each BSn^ 









5 


Long-term control variables 

Flow control d 

Routing control x 

DTX control © 

Short-term control variables 

Link scheduling p 


TABLE I: Summary of the long-term and short-term control 
variables. 


This guarantees that there would be no interference among the 
adjacent links as well. 

In each subframe, the link scheduling policy, selects a fea¬ 
sible link scheduling vector from the aforementioned feasible 
set, under the current DTX pattern. 

Definition 4. (Link Scheduling Policy) A link scheduling 
policy is a function p : Ax PL ^ ^p{cL), where A and PL are 
the DTX pattern space and the CSI space, respectively. In other 
words, under the DTX pattern a and the channel realization 
H, the subband(s) that are used by link I is determined by 
Pi = Pi{a, H) = pi,mV, V; = 1, ■ • ■, L. 

For compactness, the control policies at the physical layer 
are summarized by a physical layer (PHY) control policy. 


Definition 5. (PHY Control Policy) We define the physical 
layer control policy as the combination of the DTX control 
policy and link scheduling control policy: Cl = {0,p}, and 
its feasible set. A, is determined by: 


A = |(0, p) 0 = {A, q) e Ao, p e Ap(a) : Va e a| . 


(5) 


Table I summarizes the long-term and short-term control 
variables of our model. 


III. Two Timescale Hierarchical Radio Resource 
Management Problem Formulation 


For given DTX pattern a, link scheduling policy p and 
large-scale channel fading state the 

conditional average data rate carried on link I is given by: 


M 


E pprn{a,H) log(l + 


m=l 


H 


large 


( 6 ) 


where pi is the transmission power on each subband over link 
/, and is given by the following equation. 

_ / Pmacro if head node of link I is a macro BS, 

1 Ppico if the head node of link I is a pico BS. 

(7) 

where Pmacro and Ppico are the the power levels of macro BS 
and pico BS, respectively, on each subband. Since the coverage 
area of the macro BS is greater than the coverage area of a 
pico BS, it is obvious that Pmacro > Ppico- 

Note that because of the DTX pattern and link scheduling 
policies, there is no interference in the network. 

Therefore, for a given PHY control policy = {0, p}, with 
DTX control policy 0 = {A, q} and link scheduling policy 


p, the average conditional data rate over link I is: 

|A| 

= y2QMaj,p\H^^^n- (8) 

i=i 

For the sake of simplicity, from now on we may use 
fi (Cl) as an abbreviated notation for and define 

r{Q.) = [fi 

The performance of the network is measured by a utility 
function U{d), where d = [di,..., is the average data 
flow vector, as mentioned before. We make the following 
assumption on U{d). 

Assumption 1. (Utility Function) The utility function is ex¬ 
pressed as U{d) = Yl^=i^k{dk)^ where Uk is a twice 
continuously differentiable, strictly concave and increasing 
function of the average data flow rate dk^ \/k = 1,... ,K. 


Note that the above assumptions on the utility function 
can capture a lot of interesting cases such as a-faimess and 
proportional fairness: 

• Alpha-fair: This type of utility function can be used to 
compromise between the fairness to the users and the 
utilization of the resources. In this case, each function 
Uk{dk), V/c is defined as: 


Uk{dk) = 


log(4 + e), a = l, 

(1 — a)~^{dk + Otherwise. 

(9) 


where e > 0 is a small number. 

• Proportional Fair (PFS): This type of utility function 
is a special case of alpha-fair when a = 1 and is 
well-known for supporting high resource utilization while 
maintaining good fairness among network flows. 

For a given HetNet topology graph Q = {A/", C} and known 
source-destination pairs, the hierarchical RRM optimization 
problem can be formulated as following 


K 

P^org - max ^ ^ Ukiydk) (Ifi) 

k=l 


subject to: 

Gxk = Vk{dk), \/k = 1,...,K 

K 

v; = i,...,L 

k=l 


$7 e A. 


( 11 ) 

( 12 ) 

(13) 


Constraint (11) is the fiow conservation constraint, in which 
G is the node-link incidence matrix of the HetNet and 
'^k{dk) G is a vector that specifies the amount of net 
outgoing data rate for dk in each node. Hence, Vk{dk) = 
[vk,i ,..., is defined as follows: 

{ dk if n is the source node of fiow k, 
—dk if n is the destination node of fiow k, 

0 otherwise. 

(14) 

Furthermore, constraint (12) is the link capacity constraint 
which denotes the physical layer constraint on the average data 
rate of each link. Finally, constraint (13) guarantees that the 
chosen physical layer control policy be feasible. 
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IV. Problem Transformation and Decomposition 

According to (8), the conditional average data rate r = 
[ri,..., involves stochastic expectation over CSI realiza¬ 
tions and does not have closed-form expression. This implies 
that Porg is a stochastic optimization problem. However, since 
the problem includes control variables with combinatorial 
structure (i.e., DTX control policy), the common stochastic 
optimization techniques such as stochastic subgradient and 
stochastic cutting plane cannot be applied in this problem. 

On the other hand, due to the existence of combinatorial 
variables (i.e., P/,V/) in the capacity constraint (12), the 
feasibility region of the problem is not convex. Consequently, 
the overall problem is not convex and hence, the conventional 
convex optimization techniques cannot be used to solve it. 

In the following, we will show how to tackle with this 
challenge by transforming the original problem into a form 
which can exploit a hidden convexity. 

Note that according to the primal decomposition [?], the 
original problem can be decomposed into two sub-problems 
Vi and V 2 , as follows. 

Subproblem 1 (Optimization of the routing control d and 
flow control x under fixed physical layer control policy ft): 


K 


Pi; U{r{Q)) = 

s.t: 

maxX^ Uk{dk) 

(i,x ' 

/c=l 

(15) 

Gxk = Vk{dk), 

K 

II 

> 

(16) 

k=l 


(17) 


Subproblem 2 (Optimization of physical layer control 
policy Q): 

V 2 : maxf/(r(f^)). (18) 

For any physical layer policy ft or equivalently any data 
rate vector r, problem Vi is a standard convex optimization 
problem. Hence, using existing convex optimization methods, 
it can be easily solved in the RRMS at the beginning of each 
superframe in order to obtain the routing vector x and data 
flow rate vector d for that superframe. 

Remark 1. (Solving Problem Vi) Note that for solving 
problem Vi, we specifically use primal-dual methods. Con¬ 
sequently, by solving problem Vi, we will simultaneously 
obtain the Lagrangian multipliers or dual variables A = 
[Ai,..., \l]^ associated to constraints (17) as well. 

Using the conditions in Assumption 1, [?, Proposition 3.3.3] 
shows that the Lagrangian multiplier vector X is actually the 
gradient vector ofU (r), i.e., X = VU (r). 

On the other hand, it is very difficult to find the solution 
for problem V 2 . First, there is no closed form expression 
for the utility function U of problem V 2 because U is the 
solution of another constrained optimization problem (i.e., 
problem Vi). Moreover, the conditional average data rates 
r/, V/ in the objective function Uir) involve stochastic expec¬ 
tation over CSI realizations and fi is a non-convex function 


of ft which includes combinatorial control variables p and 
A. As a result, V 2 is a non-convex stochastic optimization 
problem with combinatorial optimization variables and hence, 
the conventional stochastic optimization techniques (such as 
stochastic subgradient and stochastic cutting plane) cannot 
be applied for solving it. In the next section, we focus on 
addressing the following challenge. 


Challenge 1. Exploit the specific structure of problem 
V 2 to find a global optimal solution for this non-convex 
stochastic optimization problem whose objective has no 
closed-form expression and whose optimization variables 
contain combinatorial variables. 


V. Solution to Problem V 2 

As elaborated before, problem V 2 is a non-convex problem 
which has no closed form expression neither for the objective 
function or the constraints. The first challenge makes it very 
difficult to find a necessary and sufficient global optimality 
condition of this problem. In this section we aim to address 
the first challenge. 

In this section, we first study the hidden convexity of V 2 
which can then be exploited to tackle with the first challenge 
and derive the global optimality condition for this problem. 
Next, based on the global optimality condition, we propose 
an iterative algorithm to efficiently solve the problem. 

A. Hidden Convexity and Global Optimality Condition of V 2 
The average data rate region can be defined as: 

1Z= 1^ [u e \ u < r(f^)} , (19) 

uga 

where r{fl) = [ri(f^),..., in which rfifl) = 

|A| 

^ Qjvfiaj^ p)^ V/ = 1,..., L, as earlier defined in (8). 
i=i 

Using this definition. Theorem 1 shows the relationship 
between problem V 2 and the following optimization problem: 

Ve ‘ maxf/(r). (20) 

Theorem 1. Suppose that fP is the global optimal solution of 
V 2 , then r(fP) is the optimal solution ofVEt cind ifr"" is the 
optimal solution ofVE, then any U* satisfying r(U*) = r* is 
the global optimal solution ofV 2 . 

Proof: Refer to Appendix A. 

The following Theorem shows that the equivalent problem 
Ve is 3 . convex problem: 

Theorem 2. (Convexity of Problem Ve) in problem Ve, the 
objective function U (r) is concave and the feasible set IZ is 
a convex set. Hence, Ve is a convex optimization problem. 

Proof: The detailed proof has been provided in Appendix 

B. 

It should be noted that although problem 7^^; is convex, it 
is not trivial to find its solution because its feasible set 1Z still 
does not have any simple characterization. 
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The convexity of the equivalent problem Ve , results in 
a hidden convexity in the original problem V 2 which will 
be utilized in order to tackle with the first aforementioned 
challenge. For this purpose, we first express the first order 
optimality condition of problem Ve as summarized in the 
following lemma. Using this Lemma, Theorem 3 then derives 
a sufficient global optimality condition for problem V 2 - 


Lemma 1. (First Order Optimality Condition of Problem Ve) 
The vector r* = is the optimal solution for 

problem Ve If 



r) > 0, \/r eV 


( 21 ) 


where gp (r) is the gradient of U at point r. 

Proof: Refer to Appendix C. 

Using Theorem 1 and Lemma 1, Theorem 3 states a 
sufficient condition for global optimality for problem V 2 as 
follows. 


Theorem 3. (Global Optimality Condition ofV 2 ) A physical 
layer control policy U* = where 0* = 

is a global optimal solution of V 2 if it satisfies the following 
condition: 


Vj = 1,..., |A*|, Va G A,'ip G Kp (a) : 



(r (a*,p*) -r(a,p)) > 0 


( 22 ) 


where (r) is the gradient of U at point r = 


Proof: The detailed proof can be found in Appendix D. 


B. Globally Optimal Solution of V 2 

Although Theorem 3 introduces a sufficient condition for 
global optimal solution of problem V 2 , achieving a solution 
which satisfies this condition is still non-trivial and challeng¬ 
ing. Therefore, in order to find the optimal solution of the 
problem, the following challenge needs to be addressed. 


Challenge 2. Find an algorithm to achieve a solution that 
satisfies the global optimality condition in Theorem 3. 


In the rest of this section, using the global optimality 
condition in Theorem 3, we propose an algorithm that it¬ 
eratively updates the optimization variables A, q, p and a 
weight vector u) = [cci,...,in a way that the globally 
optimal solution of V 2 is achieved. In the following, we first 
introduce our proposed algorithm, then we prove its properties 
and specifically, its global optimality. 

The pseudo code of the proposed algorithm is summarized 
in Algorithm 1. The indexes t and i are indicators for referring 
to a subframe and a superframe, respectively. At each subframe 
t G [(i — l)Ts +1, iTs] of the superframe, each BS chooses 
a DTX pattern from the set of DTX patterns , based on the 
time-sharing vector of the current superframe. It should 
be noted that the DTX patterns are generated at each BS using 


the same pseudo random generator and thus the DTX pattern 
generated at different BSs are the same for each subframe. 

For updating the long-term control variables, at each su¬ 
perframe 7, first, using Procedure I, a new DTX pattern a is 
obtained and added to the set of all previously found DTX 
patterns, i.e. A. Then, the optimum time sharing among them 
is found by Procedure 11. Finally, routing vector and fiow 
control vector are updated by solving Problem Vi, under 
current physical layer control policy U using the primal-dual 
method, which also finds the optimal Lagrangian multipliers 
A of 7^1. A is the gradient of the objective function U{r) 
at r (U) and it is input to Procedure I for the next iteration. 
These steps are done iteratively until convergence which will 
be checked by the termination condition. 

As mentioned above. Algorithm 1 includes two other proce¬ 
dures, named Procedure I and II. In the following, first we will 
elaborate these procedures, and then show the convergence of 
the the algorithm. 

1 ) Procedure I (Finding a new DTX pattern a) : For given 

cj. Procedure I finds a control variable that satisfies 

the global optimality condition in (22). Specifically, at the 
beginning of each superframe, RRMS broadcasts the value 
of of the current superframe to all BSs. Then, using 
this coefficients along with local CSI, each BSn calculates 
and broadcasts the average rates of each associated link 
conditioned on all admissible DTX patterns, based on the 
following equation: 

ia e A,il e r{n) : (24) 

M 

ri {a, p* {a,u)) = E (a, w) log(l + V)} 

m=l 

where p* (u?) is the link scheduling policy described in (23). 

Finally, at the end of each superframe, RRMS calls Proce¬ 
dure I to find a new DTX pattern that maximizes the overall 
weighted sum rate of the network based on the feedbacked 
conditional average rates from the BSs. 

Remark 2. The expectation term in (24) is over short- 
timescale CSI, and can be calculated by each BS knowing 
the distribution of short-timescale CSI. For the implementation 
when the distribution of the local channels are not available 
to BSs, this expectation term can be calculated using the 
running sample average over all subframes in each super- 
frame. Moreover, the calculation of (24) in each BSn does 
not require any message passing between BSs, and only local 
CSI information about the CSI of the outgoing channels from 
BSn (k^-y V/ G T{n)) is sufficient. 

2) Procedure II (Update of time sharing vector q ) : 

Having the set of chosen DTX control variables = 

j = 1, • • •, |A*^*^||, along with the associated data rate 

of links n , V/, which are feedbacked 

from the head BSs of the links to the RRMS, procedure II 
obtains the updated time sharing vector q^'^^ by solving the 
following problem for fixed A^^\ 

max U (r) (25) 






Algorithm 1 Iterative Algorithm for Solving V 2 

1: Initialization: 

2: Set i = 1, 

and let = {ai G A} and = 1. 

3: Choose proper initial routing control and flow 

control >0 such that constraint (16) is satisfled. 
Let = [1 1 ••• 1 ]^ . 

4: i i + 1. 

5: Step 1 (Short timescale link scheduling optimization 
at each BS in each subframe t e [{i — l)Ts + l^iTs]): 
6: Each BS generates a DTX pattern a according to the 

current time sharing vector q^'^\ 

7: For given of that subframe, the optimum 

link scheduling policy p* (a, is obtained by: 


At every BSn : pi* 


i — CL 

(i)- 


(*) 


(23) 


where = arg max loP log(l + 


leT(n) 


l,m I 


Step 2 (Long term DTX control optimization at RRMS 
starting from subframe {iTg —T^) in superframe i): 

to obtain a new control 


9: 

Step 2a: 



10: 

Call Procedure I with input 



variable 

a*(a)W). 


11: 

Let 

{a*(a,«)}U{af 



A« = 

-1) 

12: 

Step 2b: 



13: 

Call Procedure II with input 




to 


sharing 

vector q^^^ = [qi\ 



G > 0 




to obtain the updated time 




(0 

A(^)| 


14: Step 2c: 

15: Let ©W = and = {©(*),where 

the link scheduling policy is determined by (23). 
16: Solve problem Vi with input f which can be 

calculated using the conditional average rates 
feedbacked from BSs to the RRMS, and obtain the 
updated the routing vector the flow control vector 
and the Lagrangian multipliers 
17: Update the weight vector: 

(jj 






18: Termination 


19 : 


If I[/((!(*)) - > e, where e > 0 is a small 

number, then the algorithm is terminated with the 
optimum physical layer control policy U* = {0*,p*} 
where 0* = {A*,gf*}, A* = > o| and 

1. Otherwise, return to line 4. 


Q = 


qf>0 


Algorithm 2 Procedure I: Finding a new DTX pattern a* 
1: Input \fa e A: ri (a, p* (a, u;)), V/. 

2: for all a G A do 

L 

3: Let R{a, 01 )=^^ unri (a, p* (a, w)). 

/=! 

4: end for 

5: Output a*(u;) = arg max Ria^uj). 

aeA 


subject to: 


gjG[ 0,l], Vj, ^qj = l 

j=i 

r = [ri,...r if, where ri = qjri \^ap\ . 

7 = 1 


It can be easily verifled that problem (25) is a deterministic 
convex optimization. Therefore, it can be efficiently solved 
in the RRMS using existing convex optimization methods as 
well. 

Figure 4 illustrates the overall relationship among the main 
problem and the subproblems, as well as the feedback signals 
between BSs and the RRMS. 

3) Optimality and Convergence of Algorithm 1: In this 
section, we show that Algorithm 1 converges to the global 
optimal solution of 7 ^ 2 - In particular, we show that this 
algorithm updates the control variables in a way that the 
global optimality condition in Theorem 3 is satisfled. For this 
purpose, we first state the following Theorem about Procedure 
I. 


Theorem 4. (Property of Procedure I) For given input ca, the 
output a* (uj) of Procedure I satisfies 

Va G A, p G Ap (a) : 

• (r (a* (cj), p* (a* (cj), cj)) - r (a, p)'j >0. (26) 

Proof: The detailed proof can be found in Appendix E. 
Using Theorem 4, we prove the convergence of Algorithm 
1 and show its maximum optimality gap in general case, via 
the following Theorem. 

Theorem 5. (Global Optimality of Algorithm 1) Let he 
the output of the iteration of Algorithm 1. We have 

\imU(n^A=U*, (27) 

i^OO \ J 

where U* is the global optimal value ofV 2 - 

Proof: The detailed proof can be found in Appendix F. 


VI. Implementation Consideration and Signalling 

Flow 

According to our explanations for the proposed algorithm 
in the previous sections, the signalling flow between RRMS, 
macro/pico BS and MUs within each superframe or each 
subframe has been interpreted in Figs. 4-5. 
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RRMS 


macro/pico 



Fig. 4: Signalling flow at each superframe (slow signalling). 


macro/pico 

BS^ 



Fig. 5: Signalling flow at each subframe (fast signalling). 


Fig. 4 shows the slow timescale signalling flows that are 
being passed between RRMS and each BS at the end of each 
superframe. As can be seen in this figure, at the end of the 
super frame: 

1) Each BSn sends its associated row of the node-link 
incident matrix (i.e., Gn^h V/) to the RRMS so that 
the RRMS can have the updated information about the 
topology of the network. Moreover, BSn calculates the 
average rates of each of its associated links conditioned 
on all admissible DTX patterns r/ (a, p* (u;)), Va G ^ 
and send them to RRMS, for further central calculation 
and decision makings. 

2) Then, RRMS updates the long-term control variables 
based on these average rates. 

3) Finally, RRMS broadcasts the long-term control vari¬ 
ables and the weight vector uj to the BSs that will be 
used in the (i + 1 )^^ super frame. 

It should be noted that we allow subframes before the 
end of each superframe for RRMS to start to do all the above 
long timescale calculations and signalling. The parameter 
is chosen as a sufficient time for calculations and message 
passings. This timing configuration guarantees that our design 
is robust to the backhaul latency. 

Fig. 5 shows the fast timescale message passing between 
the head and the tail of each link, at each subframe. It is clear 
that the head node of each link is always a BS (macro or pico), 
while the tail node can be either a BS or a MU, as well. 



(meters) 


Fig. 6 : Topology of a multi-cell HetNet. The source nodes 
and the destinations nodes has been labelled as 5 i, • • •, 59 and 
di, • • •, dg, respectively. Moreover, Vi = 1, • • •, 9, 

indicates each source-destination pair. 


As can be comprehended from these figures, the proposed 
hierarchical RRM has several advantages. First, the RRMS 
requires only global statistical information which can be 
provided and feedbacked by BSs in large timescale signalling. 
Hence, the proposed solution is robust w.r.t. backhaul latency. 
On the other hand, the BSs and their associated MUs com¬ 
municate locally with each other in the short timescale, while 
each BS only required local CSI of its outgoing direct links, 
which can be feedbacked by the receiving node (receiving 
BS or MU). Therefore, our proposed algorithm has low sig¬ 
nalling overhead and message passing among different types 
of network components as well as good scalability on the 
complexity. Hence, it can be efficiently implemented in real 
applications and networks. 

VH. Simulation Results 

In this section, we consider a multi-cell HetNet with flexible 
backhaul, as shown in Fig. 6 . Each red box corresponds to 
a macro cell. In the macro cell, there is a macro BS and 
3 uniformly distributed pico BSs (constituting pico cells) 
and 3 active MUs. Black nodes, green nodes and small red 
nodes stand for macro BSs, pico BSs and MUs, respectively. 
Moreover, existing of a line between two nodes in this graph, 
shows that those nodes are in the coverage area of each other, 
i.e., they can communicate/interfere with each other. The solid 
lines show the connection links between two BSs, while the 
dashed lines show the links between BS and MU. The transmit 
power of macro BS and pico BSs are pmacro = 40 dBm 
and Ppico = 29^35 dBm, respectively. There are 10 available 
subbands in the network, and 3 data flows are to be routed 
from cell 1/2/3 to cell 2/3/1, i.e., from some BSs connected 
to the backhaul to some destined users. The PFS utility (as in 
equation (9)) is considered. 

The path-loss model for macro BS-macro BS links is based 
on the free space model in [?]. Moreover, for the pico BS-pico 
BS links and BS-MU links, the pass loss models of b5a sce¬ 
nario and c2 scenario in [?] have been used, respectively. For 
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Fig. 8: Average per-cell sum throughput comparison 


the small scale fading, block fading channel model has been 
considered, in which the small scale fading is static during 
one subframe and i.i.d. over different subframes. Moreover, 
as mentioned before, the large scale fading is considered to 
be a slow ergodic process which remains constant for several 
superframes. 


A. Performance Evaluation and Comparison 

We compare the performance of the proposed RRM scheme 
with the following baselines using numerical simulations: 

• Baseline 1, Full Backhaul Connection (FBC): Each 
pico BS is connected to the associated Macro BS in each 
cell. The RRM server adopted our proposed Algorithm 1 
to allocate the resource. 

• Baseline 2, Fixed DTX and Dynamic Subband Alloca¬ 
tion (FDDSA) [?]: The RRM server selects DTX actions 

with equal probabilities (i.e., qj = —j-, Vj = 1, • • •, |^|) 

\A.\ 

at the beginning of each frame, and the subband allocation 
is determined by the proposed link scheduling scheme in 
(23). 

• Baseline 3, One-timescale RRM adapting to Statistical 
CSI (TTRSC) [?]: The RRM server solves a problem as 
in Porg with the link scheduling policy depending on the 
large scale fading gain instead of depending on the small 
scale fading gain. 

Fig. 7 compares the proposed algorithm with these baselines 
and show the utility function versus the transmit power of 
pico BSs. We have also compared the performance of the 
proposed algorithm with the baselines in terms of the sum 
rate throughput, as shown in Fig. 8. As can be seen from 
these figure, the performance of the proposed scheme is very 
close to Baseline 1 (which has full backhaul connectivity) 
with very little performance loss due to the flexible backhaul 
design. Moreover, our proposed solution outperforms all the 
other baselines with a large gap. 


B. Complexity Comparison 

Table II illustrates the comparison of the MATLAB com¬ 
putational time of our proposed scheme and the different 


No. of Cells 

2 

3 

4 

Baseline 1 

13.235ms 

38.201ms 

80.792ms 

Baseline 2 

7.275ms 

25.037ms 

60.211ms 

Baseline 3 

11.189ms 

36.522ms 

75.314ms 

Proposed Schemes 

11.021ms 

36.772ms 

77.652ms 


TABLE II: Comparison of the MATLAB computational time 
of the baselines and the proposed algorithm in one decision 
slot. 


baselines. As it is expected, the computational time of Baseline 
2 (FDDSA) is the smallest in all different scenarios, since it 
does not have dynamic DTX control. Moreover, the compu¬ 
tational cost of our proposed schemes is very closes to those 
of Baseline 3. However, our proposed schemes outperforms 
baseline 1. These results verify that our proposed scheme 
succeeds in utilization of the local instantaneous information 
which is available at each BS with a low computational 
complexity. 


C. Convergence of the Proposed Algorithm 

Fig. 9 shows the objective value U {Ci) of V 2 versus the 
number of iterations. As can be verified from this figure, the 
proposed Algorithm converges very fast. 

VIII. Conclusion 

In this paper, we proposed a two-timescale hierarchical 
RRM for HetNets with flexible backhaul. We formulate the 
problem with a two timescale stochastic optimization prob¬ 
lem and considered the cross-layer design of fiow control, 
routing control, interference mitigation and link scheduling 
in the heterogeneous network with fiexible backhaul. Stating 
a sufficient condition for the optimal solution, we proposed 
an iterative algorithm to solve the problem. The proposed 
solution is optimal with low complexity. Moreover, due to 
the proposed two-timescale control design, it benefits from 
low signalling overhead and is robust to the backhaul latency 
as well. Simulations show that the proposed RRM design 
converges fast and it achieves significant performance gain 
over various baselines. 
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Fig. 9: Convergence result (Ppico = 30 dBm) 


Appendix 

A. Proof of Theorem 1 

If is the global optimal solution of V 2 , then from the 
definition of problems Ve and V 2 it follows that r{VP) would 
be the optimal solution of On the other hand, suppose that 
r* is the optimal solution IoVe, and fP satisfies = r*. 

By contradiction, if is not the global optimal solution of 
V 2 , then them exists another control policy O G A such that 
U{r{Q)) > As a consequent, r(r^*) = r* would 

not be the optimal solution of problem Ve, and this contradicts 
with our first assumption. 


B. Proof of Theorem 2 

In order to prove the convexity^ of problem Ve, we first 
prove that its objective function U{r) is concave. Then, we 
prove that its feasible set is convex. 

For the first part, it is sufficient to show that —U{r) 
is convex. Note that a function is convex, if and only if 
its epigraph is a convex set [?]. The epigraph of function 
—U{r) can be defined sls = [1] : r e y > —f/(r)|. 
Equivalently, it is sufficient to prove that the following set is 
convex: 

epiU ren, y< C/(r)} . (28) 


Lemma 2. The set epiU is a convex set. 

Proof: 

First note that at any arbitrary r ^ IZ, the Lagrangian 
multipliers vector A (r) would be the gradient of function 
U [?, Proposition 3.3.3] at point r. Hence, the {L + 1)- 


dimensional vector 


A(r) 

-1 


hyperplane to epiU at any point z = [fj^r) 
i.e., 


defines a non-vertical supporting 

Wen [?]; 


WeU: 

(r' — r)^ • A (r) + (^U (r) — > 0, V 2 ;' = 



(29) 
G epiU. 


We prove Lemma 2 by contradiction: Lets assume that epiU 
is NOT a convex set. This implies that there exists some Z\ = 
[yW 1^2 = [ ^2 ] ^ epiU and some 0 < a < 1, such that 

= [ yo ] - + (1 - a) - 2:2 ^ epiU- (30) 

According to the definition of epiU, and r 2 belong to the 
set n, and since it is a convex set, ar\ -\- {1 — a) r 2 G epiU 
as well. This along with (30) implies that 


yo>U (ro). 


Now consider point z = 


_ro 

U{ro) 


(31) 

The supporting hyper¬ 


plane of epiU at this point, so called , would be as 

follows. 


SHPl^u ^ 


A 

/ ' 
V 

V = 

[y'J 




(r' - rof ■ A (ro) + {u (ro) - y') = o|. (32) 


Substituting r = Vq in (29), along with (31) and (32) 
concludes that Zq and any z G epiU are not located at the 
same side of the aforementioned hyperplane. In other words, 
we have 


(r - ro)^ ■ A(ro) + ( 2/0 - 2 /) > 0, Vz = [y] e epiU. (33) 

Substituting z = Zi^Z 2 G epiU in (33) and noting that 
ro = Q^ri + (l — a)r 2 , the following two inequalities are 
concluded 


(1 - a) (ri - r 2 f ■ A (ro) + (t/o - Vi) > 0, (34) 


-a (ri - r 2 f • A (ro) + (yo - ^ 2 ) > 0. (35) 

Now we multiply (34) and (35) by a and (1 — a), respec¬ 
tively and then sum up the obtained inequalities together. It 
follows that 


yo > oiyi + (1 - a) y 2 (36) 

where, according to (30), the right hand side expression is 
equal to y^. This is obviously a contradiction and therefore 
Lemma 2 is proven. Consequently, the epigraph of —U is 
convex, and therefore, the objective function U (r) of the 
optimization problem Ve is a concave function as well. ■ 

Next, as in the second part of Theorem 2, we prove that the 
feasible set of 7^^; is convex, too. 

Lemma 3. IZ is a convex set. 

Proof: 

In order to prove the convexity of TZ, we show that IZ is 
the convex hull of the following set, i.e., IZ = Conv (IZ'). 

IZ' = (37) 

|r (a,p) = [ri {a,p),...,rL ia,p)f: aGA,pGAp (a)|. 

It is obvious that IZ C Conv (IZ'). Hence, it is sufficient to 
show that Conv (IZ') C IZ, too. According to the definition of 
IZ in (19), if any point r lies in IZ, then every point ly G 
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where u <r lies in this region as well. Therefore, in order to 
prove Conv {IZ') C 71, it is sufficient to show that any Pareto 
boundary point r' of Conv {TV) lies in 71. 

As r' is a L-dimensional Pareto boundary point, it can 
be expressed as a convex combination of L + 1 points in 
W, i.e., r' = Y.^llqjr{aj,p), where Y7j=l 
qj G [0,1], p G Kp and aj G Vj. Furthermore, as 
r' is a Pareto boundary point of Conv {71'), it follows that 
the point r {aj^ p) ^ \/p^Mj \ > 0 lies in the supporting 

hyperplane of Conv {71') at the Pareto boundary point r'. This 
fact implies that we can express r' as a convex combination 
of L points in the set {r (ai, p)p)}. Hence, 
r' = (a',p) where Ej=i 9^ = q'j ^ [ 0 > 1 ]> 

p e Ap and a' G {a^,..., a^} , Vj = 1,..., L. Conse¬ 
quently, r' lies in 71 which completes the proof of lemma 3. 
■ 

Having Lemma 2 and Lemma 3, the proof of Theorem 2 is 
completed. 


abbreviation, we will have: 


9lir) 


r=r{0*) 




1 = 1 

i=i 1=1 
\A*\ 

= 9] 9o (r) 

i=i 
|A*| 

= 9] Qfj (r) 

i=i 


r=r(n* 


r=r(n* 


• r 


[aj,p ) 


r{al,p*) 


= 9u (0 


r=r(n*) 


.r(a^p*) (42) 


where the last equation is resulted from the fact that 



Now, combining (40) and (42), the following inequality is 
easily obtained. 


C. Proof of Lemma 1 

Suppose r* = satisfies (21) and define a 

function (r) = f/ (r*) + g7 (r*) • (r — r*) over the domain 
71. Using the fact that g7 {r*) is the gradient of the concave 
function U (r) at point r*, along with the inequality in (21), 
it is easily to see that 

^{r)>U (r), Vr G 7^. (38) 

Moreover, we have: 

\I/ (r*) = ^ = U (F*) + gl (F*) • (F* - F*) 

= C/(F*) > 5'(F), VFG 7^. (39) 




(r) 


r=r{0*) 


(r (U*) -r) > 0, Vr G 7^. 


(43) 


Using Lemma 1, it follows that r(U*) is the optimal 
solution of problem Ve- Consequently, by Theorem 1, U* is 
the global optimal solution of 7 ^ 2 - This completes the proof. 


E. Proof of Theorem 4 

It is easy to see that for a given input uj. Procedure I outputs 
the optimal solution of the following optimization problem: 

max • r (a, p* (a, u;)). (44) 

aeA 

Suppose that a* {uj) is the solution of the above problem. 
Hence, it follows that 


where the last inequality is clearly resulted from ( 21 ). 

^ Combining (38) and (39), it is easily resulted that U (r*) > 
U (r), Vr G 7^. Hence, r* is the global optimal solution for 
problem Ve- 


Va G p G Ap (a) : 

• r {a* {U)), p* (a* (w), w)) > • r (a, p), (45) 

which completes the proof. 


D. Proof of Theorem 3 


Suppose U* = {0*,p*} with 0* = {A*,q*}, A* 

'I r -\7 


satisfies condi- 


{aJ,...,aE.|} and q* = [(/jf,..., , 

tion (22) in Theorem 3. It can be easily followed from (22) 
along with the definition of 7^ in (19) that: 


9~ 


(r) 


r=r{Q .*) 


• (r (a^, p*) - r) > 0, Vr G 71, (40) 


and 


Vj = l,...,|A*|: 


afr (.r) 


r=r(n’ 


■r{a*,p*) = gl (r) 


r=r(n*) 


(41) 

r{alp*). 


Using (41) and the fact that r; ($7*) = E^li' (oj,P*). 
and defining the element of gjj as gf for 


F. Proof of Theorem 5 

First, we express the following Lemma which elaborates a 
property of Algorithm 1. 

Lemma 4. (Property of Algorithm 1) Assume that is the 
physical layer control policy obtained in the iteration of 
Algorithm 1 and is the weight vector obtained in Step 

3 of this iteration. We have 

U > max^ U 1^(1 -a)f + 

(46) 

Proof: First, according to Algorithm 1, U 
is the optimal value of problem (25) with A = 
{a* U G > o|. Second, it is easy 
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to see that q = (1 — o^) ^ , Va G [0,1] is a feasible 

solution to problem (25), where > 0, Vj|. 

Using these two facts, it is easily followed that the equation 
(46) holds, and hence. Lemma 4 is proved. ■ 

By Lemma 4, it is concluded that U > 

U (r = U which implies that Algorithm 1 is 

increasing. This, along with the fact that the objective value 
is upper bounded concludes that this algorithm is convergent, 
i.e., 

,lim =f7,V, (47) 

where is the convergence value of Algorithm 1. 

Now, let a* and U* be any limiting points of sequences 
{a* } and } generated by Algorithm 1. From (26) 

we have: 

• (r (a*) -r*) > 0 (48) 

where is the gradient of U at the point r (U*) and r* is 
the global optimal solution to problem 7^2- Hence, 

+ 9*^- (.r (a*) - f (17*)) > + g*^- (r* - r (17*)) 

> U*. (49) 

where, the last inequality is due to the fact that is 
the gradient of the concave function U at the point f{Ct*). 
Consequently, we have 

g*T . ^ > 0 ( 50 ) 

Now, it is sufficient to show that '{r (a*) — f (U*)) = 0 
which results that the convergence value of Algorithm 1 is 
^ual to the global optimal solution of the problem, i.e., 
U^igi = U*. In the following, we will prove it by contra¬ 
diction. 

Assume that g*^ • (r (a^^) — r (U*)) > 0. Hence, from (50) 
it is concluded that there exist a subsequence of the iterations 
of Algorithm 1 such that g*^ • (r (a*) — r (U*)) > e, where e 
is some positive real number. Hence, using Taylor expansion 
at point f (U*), we have: 

U (f (17*) +T(r (a*) - f (17*)) ) = 

U (r (17*)) + rg*^ • (r (a*) - f (17*)) + o (r^) (51) 

for some sufficiently small r > 0. This, along with the 
contradiction assumption, results in: 

U (r (U*) + r (r (a*) - f (U*)) ) > U (r (U*)) (52) 

Therefore, we have found some point r (U*) -h r (a*) — 

r (U*)J with strictly larger utility value than the convergence 

value Uaigi^ which contradicts with with monotonic conver¬ 
gence property of Algorithm 1. Accordingly, the contradiction 
assumption is wrong and hence, g*^ • (r* (a*) — r(U*)) = 
^ This equation along with inequality (50) conclude that 
^aigi ~ which completes the proof of global optimality 
of Algorithm 1. 



